Lessons from deep learning

One consistent finding in Deep learning is that it is better to let Neural networks learn what’s important rather than tell them. The end-to-end learning works better when data is plenty. A good example is the Transformer model with Attention mechanism - Attnention is all you need